Analysis of relative influence of nodes in directed networks 
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Oh! Abstract 
O ' 

Many complex networks are described by directed links; in such networks, a link represents, for 

CZ2 \ 

■ example, the control of one node over the other node or unidirectional information flows. Some 

o ; 

°5/5 , centrality measures are used to determine the relative importance of nodes specifically in directed 

' networks. We analyze such a centrality measure called the influence. The influence represents the 

importance of nodes in various dynamics such as synchronization, evolutionary dynamics, random 

walk, and social dynamics. We analytically calculate the influence in various networks, including 

\ directed multipartite networks and a directed version of the Watts-Strogatz small-world network. The 
Os ■ 

global properties of networks such as hierarchy and position of shortcuts, rather than local properties 

[\! . 

of the nodes, such as the degree, are shown to be the chief determinants of the influence of nodes in 

OS '. 

, many cases. The developed method is also applicable to the calculation of the PageRank. We also 
. £h ' numerically show that in a coupled oscillator system, the threshold for entrainment by a pacemaker is 
5_i ■ low when the pacemaker is placed on influential nodes. For a type of random network, the analytically 



derived threshold is approximately equal to the inverse of the influence. We numerically show that 
this relationship also holds true in a random scale-free network and a neural network. 
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I. INTRODUCTION 



Networks abound in various fields; a network is a collection of nodes and links, where a 
link connects a pair of nodes. Most real-world networks are not entirely regular or random and 
have prominent properties as modeled by, for example, small-world, scale-free, hierarchical, and 
modular networks []], Q] . In such networks, some nodes are considered to be more important 
than the others. Depending on the definition of importance, various centrality measures, which 
quantify the relative importance of different nodes, have been proposed. The most frequently 
used centrality measures are perhaps the degree (i.e., the number of links owned by a node) 
and the betweenness (i.e., the normalized number of shortest paths connecting any pair of 

nnn 

nodes passing through the node in question) ]2j, [3|, |4J . New centrality measures have also been 
proposed in the field of complex networks 5|, |6| . 

Although many centrality measures are available, very few of these describe the importance 
of nodes in collective behavior of nodes on networks (see (J). In a previous study, we proposed 
a centrality measure called the influence [3]. The influence of a node denotes its importance 
in different types of dynamics. It represents the amplitude of the response of a synchronized 
network when an input is given to a certain node the fixation probability for a newly 
introduced type (e.g., new information) at a node in voter- type evolutionary dynamics {9J], the 
stationary density of a simple random walk in continuous time (^J, the so-called reproductive 
value of a node 10j, and the influence of a node in the DeGroot's model of consensus formation 



11| . It makes sense to consider the influence only in directed networks; in undirected networks, 
the influences of all the nodes take an identical value. In principle, the influence as a centrality 
measure is close to the PageRank, which was originally developed for ranking websites 131 ] . 

To assess the influence (and also the PageRank) in real complex networks, it is not sufficient 
to take into account the local property of the node, such as the degree. The global structure of 
networks such as the small- world property, modular structure, and self-similarity generally 
affects the influence values. 

In the present study, we analytically determine the influence of nodes in model networks 
such as weighted chain, directed multipartite networks with a hierarchical structure, and a 
directed version of the Watts-Strogatz small- world network Ijj]. For this purpose, we exploit 
the symmetry in networks and the relationship between the enumeration of directed spanning 
trees and the influence. We reveal the discrepancy between the actual influence and that 



2 



predicted by the mean-field approximation (MA), which takes into account only the degree. 
In fact, the nodes that occupy globally important positions in terms of influence are generally 
different from those that are locally important. The globally important nodes govern the above- 
mentioned dynamics on networks. Finally, to demonstrate the application of the influence as 
a centrality measure, we analyze a system of coupled oscillators and show that nodes with 
large influence values entrain other nodes relatively easily, i.e., with a relatively small coupling 
strength. 



II. INFLUENCE 



Consider a directed and weighted network having N nodes. The weight of the directed 
link from node i to node j is denoted by Wij. We set Wy = when the link is absent. The 
influence of node i is denoted by Vi. We define Vi as the solution for the following set of N 
linear equations: 

*= ' k * > (!<<<*). (!) 

where kf 1 = Ylj=i w ji * s the indegree of node i, and the normalization is given by YliLi v% = 1. 
When node i has many outgoing links, t\ can be large because there are many terms on the 
right-hand side of Eq. (Q. When node i has many incoming links, node i is interpreted to be 
governed by many nodes. Then, Vi can be small because of the divisive factor /cf in Eq. ([T]). 
The rationale for the definition given in Eq. (OQ) is that Vi represents the importance of nodes 
in different types of dynamics on networks, as explained in Sec. HI Values of Vi for two example 
networks are shown in Fig. [U 

Note that Vi = 1/N for any network with kf 1 = k° nt (1 < i < N), where k° nt = Ylj=i w ij 
is the outdegree. The undirected network is included in this class of networks. Therefore, the 
influence has a nontrivial meaning only in directed networks. This situation also holds true 
in the case of the PageRank; in undirected networks, the PageRank is a linear function of the 
degree of node |l3j ]. 

The MA of is given by 

1 k 1 ^ ~ k™ k^ ' 

where v = Ef=i Vj/N = 1/N. In Sees. HE and HV1 we argue that Eq. (j2J) does not satisfactorily 
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describe v^ in certain practically important types of networks, including the Watts-Strogatz 
small-world network. For these networks, we calculate the exact u$ by using different methods. 

The value of Vi can be associated with the number of directed spanning trees rooted at node 
i, as described below. Equation (TjQ) implies that Vi is the left eigenvector of the Laplacian 
matrix L, whose element is equal to La = ^f =1 j=a w ji an d Lij = —Wji (i ^ j). The 
corresponding eigenvalue is equal to 0; $2t=i = (1 < j < N). The cofactor of L is 
given as 

D{i,j) = {-1)*+* det L(i,j), (3) 

where L(i,j) is an (N — 1) x (N — 1) matrix obtained by deleting the ith row and the j'th 
column of L. Because J^Li^y = (1 < i < N), D(i,j) is independent of j. Therefore, 
considering the fact that L has eigenvalue 0, we obtain 

N N 

i=i i=i 

= detL = 0, (l<j<N). (4) 

Equation (T4|) indicates that [D (1, 1) , . . . , D (N, N)] is the left eigenvector of L with eigenvalue 
0. Therefore, we obtain 

Vi oc D (i, i) = det L (i, i) . (5) 

According to the matrix tree theorem [ljj], det L (i, i) is equal to the sum of the weights of 
all directed spanning trees of G rooted at node i. The weight of a spanning tree is defined as 
the product of the weights of the N — 1 links used in the spanning tree. Therefore, we can 
calculate Vi by enumerating the spanning trees. 

III. CALCULATION OF INFLUENCE IN SOME MODEL NETWORKS 

In this section, we analytically calculate the influence for several networks. Through these 
calculations, we show that the influence extracts the globally important nodes, which is beyond 
the scope of the MA: kf^/kf 1 . Such nodes are located upstream in the hierarchy that is defined 
by the directionality of the links or around the source of a valuable directed shortcut. 
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A. Weighted chain 



Consider a weighted chain of N nodes, as shown in Fig. 12(a). There is a link from node % 
to node % + 1 with the weight u^j+i > for each 1 < % < N — 1. There is a link from node z 
to node i — 1 with the weight iw^j-j > for each 2 < i < N. Generally, w^i+i is not equal to 
Wi + ij. There are no other links. There is only one spanning tree rooted at each node i, which 
is represented byl^2^---^z — l^i^i + l^---^N — 1 — > iV, where the arrow 
denotes either a directed link or a directed path without confusion. Therefore, 



where the normalization constant is given by 



(6) 



N 

N = /] W 2 ,lW 3t2 • • • , Wi t i-iWi t i +1 W i+ i t i + 2 ■ ■ ■ W N - ltN . (7) 
i=l 

Note that the position of the node, whether located in the middle or the periphery of the chain, 
does not affect the value of V{. 

Consider a special case where = u> 2 ,3 = ■ • ■ = Wn-i,n = 1 and w 2 ,i = w 3t2 = ■ ■ ■ = 
Wn,n-i = e [Fig. [21(b)] . For this network, we obtain 

e i_1 fl - e) 

When e is small, node % having a small % is more influential. With the normalization constant 
neglected, the MA yields k° ut /k™ = 1/e, k^/k'f = ■■■ = k^Jk^ = 1, and k^/k^ = e. 
The MA is inconsistent with Eq. (jHJ), except under the limit e — > 0, in which case Di » 1, 
v 2 ,...,v N &0. 



B. Weighted cycle 

Consider a weighted cycle having N nodes, as depicted in Fig. [2(c). The weighted cycle is 
constructed by adding two links N — > 1 and 1 — > N with the weights Wn,i and wi t N, respectively, 
to the weighted chain. 

In this network, there are N spanning trees rooted at node i, i.e., j<— j + ■••i — 1 <— 
i — > i + 1 — >•••—>• j — 1, where 1 < j < N; nodes N + 1 and are identified with nodes 1 and 
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TV, respectively. Therefore, we obtain 



Vi OC W i>i+1 W i+lji+2 ■ ■ ■ Wi_ 2 ,i-1 + W^i-iWit+i ■ ■ ■ Wi_ 3) t-2 

+ W i -. l! i- 2 Wi,i-lWi,i+l ■ ■ ■ ^i_ 4 ,i-3 H , (9) 

where Wn,n+i = Wn,i and wi )0 = Wi,jv- 

In the weighted chain, only the weights of the descending links, i.e., Wjj+i for j > i and 
for j + 1 < i, contribute to t>j. In contrast, in the weighted cycle, both Wjj + i and Wj+ij 
+ 1 i) contribute to V{. Therefore, in the weighted cycle, the effect of each link weight on 
Vi is more blurred than that in the case of the weighted chain. This property comes from the 
fact that node % and node j (i ^ j) are connected in two ways, i.e., clockwise and anticlockwise. 

As a special case, consider a directed cycle in which iu 2 ,i = 103,2 = ■ • • = wn,n-i = wi,n — 0, 
Wi t 2 = 1^2,3 = • • • = Wtf-i t N = 1, and Wn,i = e [Fig. [2](d)] . In this network, the values of the 
influence are equal to 

Vi = rrw^ip (10) 

= - = = l + { N-l)e- (U) 

The MA, which yields k\ nt /kf = 1/e, k° ut /k 2 n = ■■■ = fc^V&iV-i = 1, and k^/kft = e, is 
inconsistent with Eq. ( TTTj) except when e — ► 0. 



C. Directed multipartite network 

Consider the directed L-partite network, as schematically shown in Fig. [3](a). Layer £ (1 < 
£ < L) contains nodes. Each node in layer £ sends directed links to all the Ne + i nodes in 
layer £ + 1, where layer L + 1 is identified as layer 1. Because of symmetry, all nodes in layer £ 
have the same value of influence, denoted by vg. From Eq. ([T]), we obtain 

Ne-ive = N e+ iv e+1 , (1 < £ < L), (12) 

where N = Nl- By combining Eq. (fT2"j) with the normalization condition Ye=i ^t v £ = 1, we 
obtain 

vi = ^7 7- (13) 
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1. Super-star 



The super-star, which was introduced in 16[ to study the fixation probability in networks, is 
a variant of the directed multipartite network. The super-star shown in Fig. E^b) is generated 
as a superposition of a certain number of identical directed multipartite networks with N\ = 
N3 = 7V4 = • • • = Nl = 1 and N 2 = z {> 1). Each multipartite network is called a leave. The 
leaves are superposed such that they share a single node in layer 1. The indegree and outdegree 
of this node are equal to the number of leaves. 

It can be easily shown that Vi is independent of the number of leaves. Therefore, we consider 
the case of a single leave. Then, Eq. ( fl3l) yields 



v 1 =v, = v 5 = ... = v L = <L _ 1) + V (14) 

V2 = V *= z(L-\) + r (15) 

Surprisingly, the node in layer 1 does not have a particularly large influence value. Given z > 2, 
the nodes in the expanded layer (i.e., layer 2) and the node that receives convergent links from 
this layer (i.e., layer 3) have small influence values. These relationships are not predicted by 
the MA. The MA yields k° nt /k 2 n = k^/kfr = kg* /kg = ■■■ = kffi/k% = 1, k° ut /k™ = z, and 
k^/k™ — 1/z] the actual V\ and v 2 values are essentially smaller than the values predicted by 
the MA. 



2. Funnel 

The funnel, shown in Fig. EJ^c), was introduced in along with the super-star; it is also a 
directed multipartite network. The funnel has Nt = z L ~ l nodes in layer £ (1 < £ < L). Using 
Eq. ( ITBl . we obtain 

(*=1), 



< ZL _}' ( 16 ) 

4-^^- L " 2 , (2<£<L). 
z L — 1 

The nodes in layer L are most influential, and the node in layer 2 is least influential. The nodes 
in layer 1 are intermediately influential. For a large z, they are as influential as a node in layer 
~ L/2. These relationships are not predicted by the MA, which yields the following results: 

Lout /L.in 1 Lout /Lin Lout /Lin _,— 1 „„J Lout /Lin ~L—2 

"a / 1 J 2 I ^2 ~ ' ' ' ~ "'L-l/ ^L-l ~ z i clllu L I ^L 
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D. Directionally biased random network 



The networks considered in the previous sections have inherent global directionality due 
to the presence of asymmetrically weighted or unidirectional links from node to node or from 
layer to layer. The directionality of networks is a main cause for the deviation in the Vi values 
from those predicted by the MA. To examine the effect of directionality in further detail, we 



study the directionally biased random network 



171 ] . To generate a network from this model, we 



prepare a strongly connected directed random graph with mean indegree and mean outdegree 
z and specify a root node, which is placed in layer 1. The root node is the source of directed 
links to about z nodes, which are placed in layer 2. We align all the nodes according to their 
distance from the root. Except in the layers near the last layer, the number of nodes in layer i 
grows roughly as z . We set the weights of the forward links, i.e., links from layer I to layer 
I + 1, as unity. We set the weights of the backward links, i.e., links from layer I to layer £', 
where i > £', as e. The weights of the parallel links, i.e., those connecting two nodes in the 
same layer, are arbitrary; they do not affect the value of Vi in the following derivation. When 
e = 1, the network is an unweighted directed random graph, if the weight of the parallel link is 
equal to unity at e = 1. When e = 0, the network is purely feedforward and no longer strongly 
connected. The feedforwardness is parametrized by e. 

The directionally biased random network is approximated by using a modified tree as follows 
We assume that each node has z outgoing links and that each node except the root node 
has only one "parent" node, namely, the node in the previous layer from where it receives a 
feedforward link. Further assume that there are L layers and that layer I (1 < I < L) has 
z l ~ l nodes. The number of nodes is equal to [z L — l)/(z— 1). At this point, the constructed 
network is a tree. Then, we add backward links with weight e to this tree. When z is large, 
most backward links in the original network originate from layer L, because layer L has a 
majority of nodes. Therefore, we assume that, in the approximated network, the backward 
links with weight e originate only from the nodes in layer L. This approximation is accurate 
when z is sufficiently large. The other links in the approximated network have the weight of 
unity. For a sufficiently large z, all nodes in the same layer have almost the same connectivity 
pattern. In terms of incoming links, a node receives approximately one forward link from the 
previous layer and z backward links from layer L. The approximated network is schematically 
shown in Fig. H] On an average, each node in layer L is the source of an directed link to each 
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node with an effective weight e'. Because k° nt = ez for a node in layer L is approximated by 
e'(z L - l)/(z - 1) « e'^" 1 , we obtain e' « e2" L+2 . 

The influence of a node in layer £, denoted by V£, satisfies the following relationships: 

z^e'vx = zv 2 , (17) 
{z^e' + l)v e = zv e+1 , {2<£<L-1). (18) 

On substituting e' ~ ez~ L+2 in Eq. (1181) and considering the normalization given by 

L 

Xy-^i, ( 19 ) 

we obtain 

((^ + l)" i+1 , (* = 1), 

v e « ^ (20) 

[(e^ + ir^V^e, (2<I<L). 

For a small e, the network is close to feedforward, and V\ is relatively large; vt varies as 
V£ oc (e + z -1 )^. When z is sufficiently large, we obtain vt oc e £ , which coincides with the results 
obtained for the network shown in Fig. [21(b) (Sec. IHI All . 

To test our theory, we generate a directionally biased random network with = 5000 and 
z = 10. For e = 0.5, the values of Vi of all the nodes are plotted against the values obtained from 
the MA in Fig. Efa). Although the values obtained from the MA are strongly correlated with Vi, 
there is some variation in Vi for a fixed k° ut / kf 1 . The average and the standard deviation of Vi in 
each layer are plotted by the circles and the corresponding error bars, respectively, in Fig. H^b). 
The influence of a node decreases exponentially with I as predicted by Eq. ( f20ll [Eq. ( [20ll for 
£ > 2 is represented by the line in Fig. [5](b)]. The average and the standard deviation of Vi 
obtained by the MA are plotted by the squares and the corresponding error bars, respectively. 
The values obtained from the MA are scaled by a multiplicative factor C, where C is selected 
such that Vi = C k° nt j k™~ for the root node (i.e., £ = 1). Figure E](b) shows that fc° ut //c| n is 
generally small for node i in a downstream layer. However, the decrease in Vi with £ is much 
more than that in fc° ut //cf . The hierarchical nature of the network is revealed by the V{ values 
and not satisfactorily by the local degree. The results for e = 0.1 shown in Figs. [5](c) and[5](d) 
provide further evidence for our claim. 
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E. Small-world networks 



In this section, we analyze the influence in the directed unweighted small-world network 
model, which is a variant of the Watts-Strogatz model fiuj ]. To generate a network, we start 
with an undirected cycle of N nodes, in which each node is connected to its immediate neighbor 
on both sides. At this stage, kf 1 = kf nt = 2 is satisfied for all i. Then, we add a directed shortcut 
to the network, as schematically shown in Fig. ^a). The source and the target of the shortcut 
are denoted by nodes s and t, respectively. The distance between node s and node t along the 
cycle is assumed to be min(Ai, N2), where A2 = N — N±. 

We enumerate the number of directed spanning trees rooted at node r, which is N nodes 
away from node s along the cycle, where < N < max(Ai, A^). There are iV spanning trees 
that do not use the shortcut, as derived in Sec. IIIIBI Any spanning tree that uses the shortcut 
includes the directed path r s, which contains N + 1 nodes. The choice of the other 

links is arbitrary with the restriction that a spanning tree must be formed. The N± — N — 1 
nodes between node r and node t in Fig. [6]^a) are reached from node r or node t by a directed 
path along the cycle. There are N\ — N choices regarding the formation of this part of the 
spanning tree. The N2 — 1 nodes between node s and node t are reached from node s or node 
t by a directed path along the cycle. There are N 2 choices regarding the formation of this part 
of the spanning tree. In sum, there are N + (Ni — N)N 2 spanning trees rooted at v. Therefore, 
the influence of v is large (small) for the v that is close to the source (target) of the shortcut. 
In the region on the cycle where no source or target of a shortcut is located, the influence of 
a node changes linearly with the distance between the source of the shortcut and the node, 
because N + (Ni — N)N 2 oc —N. We call such a region, including the two border points, the 
segment . 

Next, we consider small-world networks with two directed shortcuts. There are three qualita- 
tively different possible arrangements of the shortcuts, as shown in Figs.E)^b)-[S(d). The lengths 
of the four segments are denoted by N\, N%, N3, and A/4, such that N\ + N2 + AT 3 + AT 4 = N. 

In the network shown in Fig.[6]^b), a node is located at either of the three essentially different 
positions denoted by a, b, and c. We first enumerate spanning trees rooted at node a. The 
distance from node a to the source of a shortcut, i.e., node s, is denoted by N (0 < N < Ni). 
There are N spanning trees that do not use the shortcuts. There are (Ni — N)(N 2 + N 3 + A/4) 
spanning trees that use the shortcut s — > t but not s' — > t' . There are (Ni + N2 — N) A/3 
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spanning trees that use the shortcut s' — > t' but not s — > t. There are (JVi — N)N 2 N 3 spanning 
trees that use both shortcuts, which can be explained as follows. The directed path a — > s — > s' 
is included in such a spanning tree. The Ni — N — 1 nodes between node a and node t are 
reached along the cycle from node a or node t. The A" 2 — 1 nodes between node t and node t' 
are reached along the cycle from node t or node t'. The A" 3 — 1 nodes between node t' and node 
s' are reached along the cycle from node t' or node s' . In sum, the number of spanning trees 
rooted at node a is equal to 

N + (N t - N) (N 2 + N 3 + N 4 ) + (iVi + N 2 - N)N 3 + (N t - N)N 2 N 3 , (21) 

which is proportional to the influence of node a. If N is sufficiently large and the two shortcuts 
are randomly placed, the last term in Eq. (12~TT) is of the highest order because iVj = O(N) 
(1 < i < 4). As in the case of the network with one shortcut, the influence changes linearly 
within one segment. 

Similarly, the number of spanning trees rooted at node b is equal to 

N + Nx(N 2 + N 3 + N 4 -N) + + N 2 + N)N 3 + NiN 2 N 3 , (22) 

where (0 < < A4) is the distance from node b to node s along the cycle. The number of 
spanning trees rooted at node c is equal to 

N + (N 2 - N)N 3 + N{N, (23) 

where A^ (0 < A^ < A2) is the distance from node c to node t along the cycle. Owing to 
the absence of a third-order term in Eq. ( |23l) . the influence values of the nodes located on the 
segment between the two targets of the shortcuts are very small. 

The quantities given in Eqs. (l2~TT)-(l23l) are linear in N. Therefore, the influence changes 
linearly within each segment. This is true for the other two types of arrangements of shortcuts 
shown in Figs. El^c) and[f^d). 

This linear relationship also holds true in the case of more than two shortcuts. To show this, 
we consider a general directed small-world network and focus on a segment on the cycle, which 
is schematically shown in Fig. EJ^e). Without loss of generality, we assume that a node v in the 
segment is located A^ and N\ — N nodes away from the two border points of the segment. We 
distinguish three types of spanning trees rooted at node r. First, some spanning trees include 
both r 1 and r 2. We denote the number of such spanning trees by S±. 

11 



Second, some spanning trees include r 1 and not r — > • • ■ — ► 2. For these spanning 

trees, node 2 is reached via a path r — > 1 —►••■—»• 2. To enumerate such spanning trees, 
denote by S2 the number of directed trees that span the network excluding the nodes in the 
segment between node r and node 2. Third, the other spanning trees include r 2 and 

not r 1. Denote by S3 the number of directed trees that span the network excluding 

the nodes in the segment between node r and node 1. The number of spanning trees rooted at 
node r is equal to 

S 1 + S 2 (N 1 -N) + S 3 N. 

Therefore, the influence of node r changes linearly with N (0 < N < A r 1 ). 

The thick line in Fig. [7(a) indicates the numerically obtained values of Vi for a small-world 
network with three shortcuts. We set N = 5000. The nodes are aligned according to their 
position in the cycle. In accordance with the theoretical prediction, Vi changes linearly with % 
within each segment. Vi is very small for 1250 < % < 1666, because these nodes are between 
two targets of shortcuts. 

In theory, it is assumed that each node is initially connected to only its nearest neighbors on 
the cycle (i.e., k™ = k° nt = 2). However, values of Vi are almost the same if the underlying cycle 
has k\ n = k° nt = 4 (i.e., each node is connected to two neighbors on each side) or kf 1 = k° nt = 6. 
The results for k™ = k° nt = 4 and those for fc™ = k° nt = 6 are indicated by the medium and 
thin lines, respectively, in Fig. [7J(a). The three lines are observed to almost overlap with each 
other. 

To examine the effect of shortcuts in more general small-world networks, we generate a small- 
world network by rewiring many links [jjl]. We place N = 5000 nodes on a cycle and connect 
a node to its five immediate neighbors on each side, such that kf 1 = k° ut = 10. Then, out of 
50000 directed links, we rewire 500 randomly selected ones to create directed shortcuts. The 
sources and targets of shortcuts are chosen randomly from the network with the restriction that 
self loops and multiple links must be avoided. Because of rewiring, the mean degree (k) = 10. 

As shown in Fig. [7J(b), the MA strongly disagrees with the observed js]. The values of 
Vi are plotted against the circular positions of the nodes in Fig. [7J(c). Vi changes gradually 
along the cycle, which is consistent with our analytical results. The peaks and troughs in 
Fig. E(c) correspond to the sources and targets of the shortcuts, respectively. For a node near 
a source (target), t>j is large (small), whereas the MA estimate oc fc° ut //c™ is not as affected 
by the position of the shortcuts as Uj. The relationship between v i/ik^/kf 1 ) and u$ is shown 
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in Fig. 0(d). The MA is exact along the horizontal line, i.e., v { = (k° ut / kf 1 ) / (J2f =1 k° ut /kf). 
Nodes with large (small) V{ tend to be located near sources (targets) of shortcuts. For such 
nodes, Vi is usually larger (smaller) than the value obtained by the MA. 

IV. ENTRAINMENT OF A NETWORK BY A PACEMAKER 

As an application of the influence as a centrality measure, we examine a system of coupled 



phase oscillators having a pacemaker 17J, ll8| . Consider a dynamical system of phase oscillators 
given by 



N 

UJi + 7V)Yl w ^ sin ~ ' (24) 
\ / j= i 

where the mean degree (k) = ., =1 u^y/iV provides the normalization for the coupling 
strength The phase and the intrinsic frequency of the ith oscillator are denoted by ^ 6 [0, 2-7r) 
and Ui, respectively. We assume a pacemaker, i.e., an oscillator that is not influenced by the 
other oscillators, in the network. Equation f[2"4"l) emulates a pacemaker system, where the 
pacemaker is placed at node io, if we force Wji — (1 < j < N). We examine the possibility 
of the pacemaker to entrain the other oscillators into its own intrinsic rhythm. For this, we 
assume that cjj = w (i^ io) is identical for the N — 1 oscillators and that Ui takes a different 
value. By redefining (pi — out as the new 0; and rescaling time, we set Ui = 1 and Ui = (i ^ io) 
without loss of generality. 

Depending on the network and the position of the pacemaker, there exists a critical threshold 
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18j |. When entrained, the actual 



k ct such that the entrainment is realized for k > k c 
frequency of all oscillators becomes exactly the same as that of the pacemaker, i.e., Ui = 1. 
Thus, the condition for the entrainment is given by 

N 

^ = S Wji sin ^ j - = {l<i< N,i^ io). (25) 

In general, entrainment [l^, [loj and synchrony 20] are easily realized for feedforward net- 



works. Because of the intuitive meaning of the influence, k ci may be small if Vi Q is large. We 
analytically show this for the directionally biased random network with a sufficiently large z. In 
the directionally biased random network, a node in layer i receives a forward link with weight 
unity from layer I — 1. Although a forward link is absent for a node in layer 1, this factor is 
negligible because layer 1 contains only one node. Most backward links with weight e to a node 

13 



in layer £, where £ < L — 1, originate from layer L, as discussed in Sec. IIII Dl The number of 
parallel links is smaller than that of backward links. We assume that the weight of the parallel 
link, which was assumed to be arbitrary in Sec. IIII Dl is equal to e, such that all the incoming 
links to a node in layer L, except one forward link, also have weight e. Under this condition, 
we approximate (k) ~ 1 + ez. 

Denote by nir the typical critical coupling strength when the pacemaker is located at a node 
in layer I in a directionally biased random network. First, we consider the case in which node 
2o coincides with the root node in the directionally biased random network, i.e., £ = 1. This 
case was analyzed in our previous studies 13, ll8[. In the entrained state, the phase difference 
between the oscillators in the same layer is small, and the phases of the oscillators in layers 
with small £ are more advanced. Therefore, we assume that the phases of all the oscillators in 
the same layer are identical. We set the difference between the phase of the oscillator in layer 
£ and that in layer £ + 1 to A(p£. The entrainment occurs if and only if A<f>x, . . ., A<pi_i stay 
constant in a long run. We obtain H, Q] 

A<p e = { €Z) , (1<£<L-1). (26) 

By applying the threshold condition A<fri = 1, we obtain 

«fi } = (1 + ^) L_1 ■ (27) 

Next, we consider the case in which node i$ is located in layer L. We expect ni^ to be large 

because the pacemaker is located downstream of the network. To analyze this case, we redraw 

the network as a directionally biased random network, such that node i$ is located at the root. 

Then, statistically, the network is the same as the original directionally biased random network 

in terms of the positions of the nodes and the links. However, the effective link weight in the 

redrawn network is equal to e, because most links in the original network are backward links 

with weight e. By assuming that all links in the redrawn network have weight e, the result for 

Q translates into 
L-e-i 

, (1<£<L-1), (28) 



unweighted directed random graph |1 

(l + ez)(l + z 



K€ 

where £ is the layer number in the redrawn network. Analogous to the derivation of Eq. (12 7j) 
from Eq. (|26l) . from Eq. (|28|) . we derive 

4? = (1 + "" t 1 + 2)L " 2 . (29, 
14 



When node i$ is located in the (L — M + l)th layer (2 < M < L — 1) in the original network, 
we redraw the network in a similar manner, such that node io is located at the root. The 
redrawn network is schematically shown in Fig. El For this network, we obtain 



(l+z) L - M (l+ez) M - e 



[1<£<M-1), 



^t=< . .. r (30) 

(M < £ < L — 1), 



( 1+ez )(l +z )L-t-l 



which yields 



JL-M+1) = (1 + ez) M-l (1 + z) L-M (31) 



From Eqs. flU, and ([SI]), we obtain 

= ' — " (32) 



(1 + ez) L - 1 (1 + z)^ 1 , {1<£<L-1), 



Under the condition 3> 1, Eq. (1521 gives 

/tgw^'z 1 " 1 , (1 (33) 

Moreover, Eq. f[2"U]) yields 

u< « e- x+ ^- x+1 , (1<*<L). (34) 

Therefore, we obtain 

«g> « -, (1 < £ < L). (35) 

Equation ( !35l shows that the pacemaker located at an influential node can easily realize 
the entrainment. We validate this prediction by direct numerical simulations of the pacemaker 
system on a directionally biased random network with N = 200, z = 10, and e = 0.1. To 
judge whether the entrainment has been realized for a value of k, we measure the ratio of 
£]Li,^ (< = T) - <pi (t = 0.8T)] / (TV - 1) to JO (t = T) - <j) io {t = 0.8T), where T is the 
duration of a run. The first 80% of a run is discarded as transient. The ratio represents 
the average phase shift of the oscillators, other than the pacemaker, relative to that of the 
pacemaker. If this value is more than 0.99, we consider the entrainment to be achieved. Because 
the transient is shorter for larger k, we set T = 5 x 10 5 /«;. 

The values of n cr when the pacemaker is located at different nodes are plotted against Vi in 
Fig. E](a). The line in the figure represents n cr = , i.e., Eq. fl3"o"]) . The numerically obtained 
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K cr roughly matches the theoretical one although the condition ez 3> 1 is violated. The same 
values of k cv are plotted against the MA estimate Vi (fc° ut /A;™)/ Y^j=i{k^ nt I kf) in Fig. Mjo). 
We find a larger spread of data in this plot as compared to that in Fig. Eta); Vi predicts k ci 
more accurately than the MA. 

Next, we set the weight of the parallel link to unity, as done in 13]. The values of k ct for 
this version of the directionally biased random network are shown in Figs. Etc) andEtd)- The 
results are qualitatively the same as those shown in Figs. Et a ) and 0(b). The dependence of k cv 
on Vi is weak in the new network [Fig. Et c )] as compared to the previous network [Fig. Eta)] 
mainly for the following reason. Because of the difference in the weights of parallel links, (k) 
in the new network is larger than that in the previous network. Then, nffl is smaller for the 
new network, since it is inversely proportional to the effective link weight. We expect that n cr 
for nodes in intermediate layers can also be explained using the same approach. 

The result K cr is derived for the directionally biased random network. Although 

it is not guaranteed that this relationship holds true in other types of networks, we test the 
applicability of the relation n CT m v^ 1 in a scale-free network and a neural network. 

We generate a directed scale-free network using the configuration model Ij. The degree 
distributions are independently given for k™ and k° ut by p(k in ) oc k~ Tin and p(k ont ) oc k~ lout , 
respectively. We set 7; n = 7 out = 2.5 and N = 200. The minimum degree is set to 3. The 
duration of a run and the length of the traisient are equal to those in the case of the directionally 
biased random networks. The values of K cr are plotted against Vi and the MA in Figs. Et e ) 
andE^f), respectively. The relation k ct fa v^ 1 fits the data reasonably well, even though the 
scale-free network is not a directionally biased random network. In contrast, /c° ut //c™ poorly 
predicts shown in Fig. M^)- 



We next examine the C. elegans neural network [21|, [22| based on chemical synapses, which 
serve as directed links. The network has the largest strongly connected component with N = 
237 nodes. The number of synapses from neuron % to neuron j defines Wij. We set T = 
2.5 x 10 7 /k to appropriately exclude the transient. The values of K cr are plotted against and 
the MA in Figs. Etg) andEth), respectively. The values of k ct exceeding 10 7 are not plotted 
because direct numerical simulations need too much time. Similar to the case of scale-free 
networks, Vi predicts n cr better than k^/kf 1 does. 
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V. CONCLUSIONS 



In this paper, we have analyzed the centrality measure called the influence in various net- 
works. The influence extracts the magnitude with which a node controls or impacts the entire 
network along directed links. We have analytically shown that the source of the shortcut in a 
directed version of the Watts-Strogatz small-world network and the root node in hierarchical 
networks have large influence values. This is not accurately predicted if we approximate the 
influence of a node by its degree. Although by definition, the influence is based on local connec- 
tivity, the global structure of networks does affect the influence values. We also analyzed the 
effect of the location of a pacemaker on the capability of entrainment in a system of coupled 
phase oscillators. The pacemaker located at a node with a large influence value entrains the 
other oscillators relatively easily. 

In the analysis of some model networks, including the Watts-Strogatz small-world network, 
we used the method based on the enumeration of directed spanning trees. This method can be 
applied to the estimation of the PageRank of nodes, because the PageRank can be mapped to 
the influence if we reverse links and rescale the link weight p, |9|] . Application of our results to 
other centrality measures for directed networks is warranted for future study. 

Acknowledgments 

N.M. acknowledges the support provided by the Grants- in- Aid for Scientific Research (Grants 
No. 20760258 and No. 20540382) from MEXT, Japan. 



[1] R. Albert and A.-L. Barabasi, Rev. Mod. Phys. 74, 47 (2002); M. E. J. Newman, SIAM Rev. 45, 

167 (2003); G. Caldarelli, Scale-free networks (Oxford University Press, Oxford, 2007). 
[2] S. Boccaletti, V. Latora, Y. Moreno, M. Chavez, and D.-U. Hwang, Phys. Rep. 424, 175 (2006). 
[3] L. C. Freeman, Soc. Networks 1, 215 (1979). 

[4] S. Wasserman and K. Faust, Social Network Analysis (Cambridge University Press, New York, 
1994). 

[5] J. D. Noh and H. Rieger, Phys. Rev. Lett. 92, 118701 (2004); M. E. J. Newman, Soc. Networks 
27, 39 (2005); E. Estrada and J. A. Rodriguez- Velazquez, Phys. Rev. E 71, 056103 (2005); V. 



17 



Latora and M. Marchiori, New J. Phys. 9, 188 (2007). 
[6] J. G. Restrepo, E. Ott, and B. R. Hunt, Phys. Rev. Lett. 97, 094102 (2006). 
[7] N. Masuda, Y. Kawamura, and H. Kori, New J. Phys., in press (2009). larXiv:0909.0700l 
[8] H. Kori, Y. Kawamura, H. Nakao, K. Arai, and Y. Kuramoto, Phys. Rev. E 80, 036207 (2009). 
[9] N. Masuda and H. Ohtsuki, New J. Phys. 11, 033012 (2009) 
[10] P. D. Taylor, Am. Nat. 135, 95 (1990); J. Math. Biol. 34, 654 (1996). 

[11] M. H. DeGroot, J. Am. Stat. Assoc. 69, 118 (1974); N. E. Friedkin, Am. J. Sociol. 96, 1478 
(1991). 



12] http://pajek.imfm.si/doku.php 



[13] S. Brin and L. Page, Proceedings of the Seventh International World Wide Web Conference 

(Brisbane, Australia, 14-18 April) p. 107-117 (1998); P. Berkhin, Internet Math. 2, 73 (2005). 
[14] D. J. Watts and S. H. Strogatz, Nature (London) 393, 440 (1998). 

[15] N. Biggs, Bull. London Math. Soc. 29, 641 (1997); R. P. Agaev and P. Yu. Chebotarev, Autom. 

Remote Control (Engl. Transl.) 61, 1424 (2000). 
[16] E. Lieberman, C. Hauert, and M. A. Nowak, Nature (London) 433, 312 (2005). 
[17] H. Kori and A. S. Mikhailov, Phys. Rev. E 74, 066115 (2006). 
[18] H. Kori and A. S. Mikhailov, Phys. Rev. Lett. 93, 254101 (2004). 
[19] N. Masuda and H. Kori, J. Comput. Neurosci. 22, 327 (2007). 

[20] D.-U. Hwang, M. Chavez, A. Amann, and S. Boccaletti, Phys. Rev. Lett. 94, 138701 (2005); T. 

Nishikawa and A. E. Motter, Phys. Rev. E 73, 065106(R) (2006); Physica D 224, 77 (2006); X. 

Wang, Y.-C. Lai, and C. H. Lai, Phys. Rev. E 75, 056205 (2007). 
[21] B. L. Chen, D. H. Hall, and D. B. Chklovskii, Proc. Natl. Acad. Sci. U.S.A. 103, 4723 (2006). 



22] http://www.wormatlas.org 



18 



FIG. 1: Influence of nodes in networks having N = 20. A dark node has a large value of V{. (a) Directed 
network generated by the configuration model. The indegree and outdegree follow independent power- 
law distributions with the scaling exponent 2.5 and the minimum degree 2. (b) Directed Watts- 
Strogatz network with three shortcuts. See Sees. UTTEI and ITvl for details of the network models. The 
networks are visualized by Pajek [l^ . 
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FIG. 2: Schematic of (a) weighted chain, (b) special case of weighted chain, (c) weighted cycle, and 
(d) special case of weighted cycle. 
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FIG. 3: Schematic of (a) multipartite network, (b) super-star, and (c) funnel. 
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FIG. 4: Schematic of directionally biased random network under tree approximation. 
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FIG. 5: Vi for a directionally biased random network with N = 5000. We set [(a) and (b)] e = 0.5 
and [(c) and (d)] e = 0.1. In (a) and (c), V{ is plotted against the MA results. The lines represent 
the MA: Vi = ikf/kf)/ T,f=i(kj Ut /kf). In (b) and (d), v { (circles) and rescaled kf^/kf 1 (squares) 
averaged over all the nodes in each layer are plotted against layer number. The error bars indicate 
the standard deviation obtained from Vi of all the nodes in the same layer. 
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FIG. 6: Schematics of directed small-world network having (a) one shortcut, [(b)-(d)] two shortcuts, 
and (e) general number of shortcuts. 
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FIG. 7: V{ in directed small- world networks with N = 5000. (a) Results for small- world network with 
three added shortcuts. The thick, medium, and thin lines correspond to the networks in which the 
mean degree of the underlying cycle is equal to 2, 4, and 6, respectively, (b, c, d) Results for directed 
small- world network with (k) = 10 and 500 rewired shortcuts. In (a) and (c), Vi is plotted against the 
position of the node in the underlying cycle. In (b), Vi is plotted against (k° nt /kf 1 )/ Y,j =1 (k° ut /kf); 
the line represents the MA result. In (d), the relationship between Vi / (kf ut / kf 1 ) and Vi is shown. 
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FIG. 8: Schematic of redrawn directionally biased random network when the pacemaker is located in 
layer L — M + 1 in the original directionally biased random network. 
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FIG. 9: Relationships between k ct and V{. (a, b) Directionally biased random network with N = 200 
and weight of parallel links equal to e. (c, d) Directionally biased random network with N = 200 
and weight of parallel links equal to unity, (e, f) Directed scale-free network with ./V = 200. (g, h) 
C. elegans neural network with N = 237. The data are plotted against Vi in (a, c, e, g) and against 
(k° at /kf)/^=i{kf lt /kf) in (b, d, f, h). The lines in (a, c, e, g) represent k ci rj ur 1 . 
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